Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 3897 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 410 |
| Duplicate rows (%) | 10.5% |
| Total size in memory | 395.9 KiB |
| Average record size in memory | 104.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 1 |
| Dataset has 410 (10.5%) duplicate rows | Duplicates |
fixed acidity is highly overall correlated with wine_type | High correlation |
volatile acidity is highly overall correlated with wine_type | High correlation |
residual sugar is highly overall correlated with density | High correlation |
chlorides is highly overall correlated with density and 1 other fields | High correlation |
free sulfur dioxide is highly overall correlated with total sulfur dioxide and 1 other fields | High correlation |
total sulfur dioxide is highly overall correlated with free sulfur dioxide and 1 other fields | High correlation |
density is highly overall correlated with residual sugar and 2 other fields | High correlation |
alcohol is highly overall correlated with density | High correlation |
wine_type is highly overall correlated with fixed acidity and 4 other fields | High correlation |
citric acid has 93 (2.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-10-28 11:17:18.343561 |
|---|---|
| Analysis finished | 2023-10-28 11:17:56.688096 |
| Duration | 38.34 seconds |
| Software version | ydata-profiling vv4.6.0 |
| Download configuration | config.json |
fixed acidity
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 100 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.2276751 |
| Minimum | 3.8 |
|---|---|
| Maximum | 15.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 3.8 |
|---|---|
| 5-th percentile | 5.7 |
| Q1 | 6.4 |
| median | 7 |
| Q3 | 7.7 |
| 95-th percentile | 9.9 |
| Maximum | 15.9 |
| Range | 12.1 |
| Interquartile range (IQR) | 1.3 |
Descriptive statistics
| Standard deviation | 1.3317322 |
|---|---|
| Coefficient of variation (CV) | 0.18425458 |
| Kurtosis | 4.9110989 |
| Mean | 7.2276751 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | 1.7471779 |
| Sum | 28166.25 |
| Variance | 1.7735107 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.8 | 217 | 5.6% |
| 6.6 | 214 | 5.5% |
| 6.4 | 180 | 4.6% |
| 7.2 | 164 | 4.2% |
| 7 | 161 | 4.1% |
| 6.9 | 160 | 4.1% |
| 6.7 | 159 | 4.1% |
| 7.1 | 155 | 4.0% |
| 7.3 | 147 | 3.8% |
| 7.4 | 145 | 3.7% |
| Other values (90) | 2195 |
| Value | Count | Frequency (%) |
| 3.8 | 1 | < 0.1% |
| 4.2 | 1 | < 0.1% |
| 4.4 | 2 | 0.1% |
| 4.5 | 1 | < 0.1% |
| 4.7 | 4 | 0.1% |
| 4.8 | 5 | 0.1% |
| 4.9 | 3 | 0.1% |
| 5 | 20 | |
| 5.1 | 19 | |
| 5.2 | 23 |
| Value | Count | Frequency (%) |
| 15.9 | 1 | |
| 15.6 | 1 | |
| 15.5 | 1 | |
| 15 | 1 | |
| 14.3 | 1 | |
| 13.8 | 1 | |
| 13.7 | 2 | |
| 13.5 | 1 | |
| 13.4 | 1 | |
| 13.3 | 2 |
volatile acidity
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 171 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.34235694 |
| Minimum | 0.08 |
|---|---|
| Maximum | 1.58 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0.08 |
|---|---|
| 5-th percentile | 0.16 |
| Q1 | 0.23 |
| median | 0.3 |
| Q3 | 0.41 |
| 95-th percentile | 0.67 |
| Maximum | 1.58 |
| Range | 1.5 |
| Interquartile range (IQR) | 0.18 |
Descriptive statistics
| Standard deviation | 0.1656422 |
|---|---|
| Coefficient of variation (CV) | 0.48382897 |
| Kurtosis | 3.4078917 |
| Mean | 0.34235694 |
| Median Absolute Deviation (MAD) | 0.08 |
| Skewness | 1.5653356 |
| Sum | 1334.165 |
| Variance | 0.02743734 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.28 | 177 | 4.5% |
| 0.24 | 162 | 4.2% |
| 0.26 | 156 | 4.0% |
| 0.27 | 150 | 3.8% |
| 0.22 | 146 | 3.7% |
| 0.32 | 133 | 3.4% |
| 0.3 | 131 | 3.4% |
| 0.23 | 128 | 3.3% |
| 0.25 | 126 | 3.2% |
| 0.2 | 118 | 3.0% |
| Other values (161) | 2470 |
| Value | Count | Frequency (%) |
| 0.08 | 2 | 0.1% |
| 0.1 | 3 | 0.1% |
| 0.105 | 5 | 0.1% |
| 0.11 | 10 | 0.3% |
| 0.115 | 2 | 0.1% |
| 0.12 | 18 | |
| 0.125 | 1 | < 0.1% |
| 0.13 | 25 | |
| 0.135 | 1 | < 0.1% |
| 0.14 | 41 |
| Value | Count | Frequency (%) |
| 1.58 | 1 | < 0.1% |
| 1.33 | 2 | |
| 1.24 | 1 | < 0.1% |
| 1.18 | 1 | < 0.1% |
| 1.13 | 1 | < 0.1% |
| 1.1 | 1 | < 0.1% |
| 1.09 | 1 | < 0.1% |
| 1.07 | 1 | < 0.1% |
| 1.04 | 3 | |
| 1.025 | 1 | < 0.1% |
citric acid
Real number (ℝ)
ZEROS 
| Distinct | 85 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.32001026 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 93 |
| Zeros (%) | 2.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.05 |
| Q1 | 0.25 |
| median | 0.31 |
| Q3 | 0.4 |
| 95-th percentile | 0.56 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.15 |
Descriptive statistics
| Standard deviation | 0.14623562 |
|---|---|
| Coefficient of variation (CV) | 0.45697165 |
| Kurtosis | 1.1568587 |
| Mean | 0.32001026 |
| Median Absolute Deviation (MAD) | 0.07 |
| Skewness | 0.34229552 |
| Sum | 1247.08 |
| Variance | 0.021384856 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3 | 195 | 5.0% |
| 0.28 | 184 | 4.7% |
| 0.49 | 176 | 4.5% |
| 0.32 | 169 | 4.3% |
| 0.34 | 162 | 4.2% |
| 0.26 | 152 | 3.9% |
| 0.29 | 147 | 3.8% |
| 0.24 | 133 | 3.4% |
| 0.31 | 133 | 3.4% |
| 0.27 | 127 | 3.3% |
| Other values (75) | 2319 |
| Value | Count | Frequency (%) |
| 0 | 93 | |
| 0.01 | 17 | 0.4% |
| 0.02 | 39 | |
| 0.03 | 16 | 0.4% |
| 0.04 | 26 | 0.7% |
| 0.05 | 16 | 0.4% |
| 0.06 | 22 | 0.6% |
| 0.07 | 27 | 0.7% |
| 0.08 | 18 | 0.5% |
| 0.09 | 27 | 0.7% |
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 0.91 | 1 | < 0.1% |
| 0.86 | 1 | < 0.1% |
| 0.82 | 1 | < 0.1% |
| 0.81 | 2 | 0.1% |
| 0.8 | 2 | 0.1% |
| 0.79 | 3 | |
| 0.78 | 1 | < 0.1% |
| 0.76 | 2 | 0.1% |
| 0.75 | 1 | < 0.1% |
residual sugar
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 280 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.3817937 |
| Minimum | 0.7 |
|---|---|
| Maximum | 26.05 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0.7 |
|---|---|
| 5-th percentile | 1.2 |
| Q1 | 1.8 |
| median | 3 |
| Q3 | 8 |
| 95-th percentile | 15 |
| Maximum | 26.05 |
| Range | 25.35 |
| Interquartile range (IQR) | 6.2 |
Descriptive statistics
| Standard deviation | 4.6489006 |
|---|---|
| Coefficient of variation (CV) | 0.86381992 |
| Kurtosis | 0.43803368 |
| Mean | 5.3817937 |
| Median Absolute Deviation (MAD) | 1.7 |
| Skewness | 1.166878 |
| Sum | 20972.85 |
| Variance | 21.612277 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 146 | 3.7% |
| 1.8 | 144 | 3.7% |
| 1.4 | 133 | 3.4% |
| 1.6 | 122 | 3.1% |
| 1.2 | 118 | 3.0% |
| 2.2 | 116 | 3.0% |
| 1.7 | 111 | 2.8% |
| 1.5 | 107 | 2.7% |
| 2.1 | 99 | 2.5% |
| 1.9 | 95 | 2.4% |
| Other values (270) | 2706 |
| Value | Count | Frequency (%) |
| 0.7 | 3 | 0.1% |
| 0.8 | 17 | 0.4% |
| 0.9 | 29 | 0.7% |
| 0.95 | 3 | 0.1% |
| 1 | 46 | 1.2% |
| 1.05 | 1 | < 0.1% |
| 1.1 | 92 | |
| 1.15 | 2 | 0.1% |
| 1.2 | 118 | |
| 1.25 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 26.05 | 2 | |
| 22 | 2 | |
| 20.8 | 2 | |
| 20.3 | 1 | < 0.1% |
| 20.15 | 1 | < 0.1% |
| 19.95 | 3 | |
| 19.9 | 1 | < 0.1% |
| 19.8 | 1 | < 0.1% |
| 19.6 | 1 | < 0.1% |
| 19.5 | 2 |
chlorides
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 181 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.056255068 |
| Minimum | 0.009 |
|---|---|
| Maximum | 0.611 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0.009 |
|---|---|
| 5-th percentile | 0.028 |
| Q1 | 0.038 |
| median | 0.047 |
| Q3 | 0.066 |
| 95-th percentile | 0.102 |
| Maximum | 0.611 |
| Range | 0.602 |
| Interquartile range (IQR) | 0.028 |
Descriptive statistics
| Standard deviation | 0.035657042 |
|---|---|
| Coefficient of variation (CV) | 0.63384586 |
| Kurtosis | 56.34578 |
| Mean | 0.056255068 |
| Median Absolute Deviation (MAD) | 0.011 |
| Skewness | 5.6651288 |
| Sum | 219.226 |
| Variance | 0.0012714247 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.044 | 134 | 3.4% |
| 0.042 | 116 | 3.0% |
| 0.034 | 114 | 2.9% |
| 0.048 | 113 | 2.9% |
| 0.036 | 112 | 2.9% |
| 0.05 | 112 | 2.9% |
| 0.045 | 112 | 2.9% |
| 0.04 | 109 | 2.8% |
| 0.038 | 105 | 2.7% |
| 0.047 | 101 | 2.6% |
| Other values (171) | 2769 |
| Value | Count | Frequency (%) |
| 0.009 | 1 | < 0.1% |
| 0.012 | 1 | < 0.1% |
| 0.014 | 3 | 0.1% |
| 0.015 | 3 | 0.1% |
| 0.016 | 4 | |
| 0.017 | 2 | 0.1% |
| 0.018 | 6 | |
| 0.019 | 3 | 0.1% |
| 0.02 | 8 | |
| 0.021 | 9 |
| Value | Count | Frequency (%) |
| 0.611 | 1 | |
| 0.61 | 1 | |
| 0.422 | 1 | |
| 0.415 | 1 | |
| 0.414 | 2 | |
| 0.413 | 1 | |
| 0.403 | 1 | |
| 0.387 | 1 | |
| 0.368 | 1 | |
| 0.36 | 1 |
free sulfur dioxide
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 118 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.56582 |
| Minimum | 2 |
|---|---|
| Maximum | 146.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 17 |
| median | 29 |
| Q3 | 41 |
| 95-th percentile | 61 |
| Maximum | 146.5 |
| Range | 144.5 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 17.828001 |
|---|---|
| Coefficient of variation (CV) | 0.58326592 |
| Kurtosis | 1.7436102 |
| Mean | 30.56582 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.89308059 |
| Sum | 119115 |
| Variance | 317.83762 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 29 | 107 | 2.7% |
| 6 | 104 | 2.7% |
| 15 | 103 | 2.6% |
| 31 | 95 | 2.4% |
| 24 | 94 | 2.4% |
| 27 | 90 | 2.3% |
| 26 | 89 | 2.3% |
| 34 | 89 | 2.3% |
| 35 | 87 | 2.2% |
| 28 | 85 | 2.2% |
| Other values (108) | 2954 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 3 | 39 | 1.0% |
| 4 | 30 | 0.8% |
| 5 | 84 | |
| 6 | 104 | |
| 7 | 63 | |
| 8 | 46 | |
| 9 | 52 | |
| 10 | 83 | |
| 11 | 62 |
| Value | Count | Frequency (%) |
| 146.5 | 1 | |
| 138.5 | 1 | |
| 131 | 1 | |
| 128 | 1 | |
| 124 | 1 | |
| 122.5 | 1 | |
| 118.5 | 1 | |
| 108 | 2 | |
| 105 | 1 | |
| 101 | 2 |
total sulfur dioxide
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 265 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 115.24942 |
| Minimum | 6 |
|---|---|
| Maximum | 366.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 77 |
| median | 118 |
| Q3 | 155 |
| 95-th percentile | 207 |
| Maximum | 366.5 |
| Range | 360.5 |
| Interquartile range (IQR) | 78 |
Descriptive statistics
| Standard deviation | 56.764684 |
|---|---|
| Coefficient of variation (CV) | 0.49253769 |
| Kurtosis | -0.45827173 |
| Mean | 115.24942 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | -0.0024354061 |
| Sum | 449127 |
| Variance | 3222.2294 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 111 | 42 | 1.1% |
| 126 | 38 | 1.0% |
| 98 | 38 | 1.0% |
| 125 | 36 | 0.9% |
| 124 | 36 | 0.9% |
| 122 | 36 | 0.9% |
| 87 | 36 | 0.9% |
| 119 | 35 | 0.9% |
| 116 | 35 | 0.9% |
| 118 | 35 | 0.9% |
| Other values (255) | 3530 |
| Value | Count | Frequency (%) |
| 6 | 3 | 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 10 | |
| 9 | 11 | |
| 10 | 12 | |
| 11 | 15 | |
| 12 | 20 | |
| 13 | 20 | |
| 14 | 23 | |
| 15 | 21 |
| Value | Count | Frequency (%) |
| 366.5 | 1 | |
| 344 | 1 | |
| 313 | 1 | |
| 307.5 | 1 | |
| 289 | 1 | |
| 282 | 1 | |
| 278 | 1 | |
| 272 | 2 | |
| 260 | 1 | |
| 256 | 1 |
density
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 871 |
|---|---|
| Distinct (%) | 22.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.99469028 |
| Minimum | 0.98711 |
|---|---|
| Maximum | 1.00315 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0.98711 |
|---|---|
| 5-th percentile | 0.989924 |
| Q1 | 0.99228 |
| median | 0.9949 |
| Q3 | 0.99694 |
| 95-th percentile | 0.9993 |
| Maximum | 1.00315 |
| Range | 0.01604 |
| Interquartile range (IQR) | 0.00466 |
Descriptive statistics
| Standard deviation | 0.0029469664 |
|---|---|
| Coefficient of variation (CV) | 0.0029626975 |
| Kurtosis | -0.76983279 |
| Mean | 0.99469028 |
| Median Absolute Deviation (MAD) | 0.0023 |
| Skewness | -0.034972275 |
| Sum | 3876.308 |
| Variance | 8.6846108 × 10-6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.9976 | 44 | 1.1% |
| 0.9986 | 41 | 1.1% |
| 0.9972 | 40 | 1.0% |
| 0.9944 | 39 | 1.0% |
| 0.992 | 36 | 0.9% |
| 0.9958 | 36 | 0.9% |
| 0.9968 | 35 | 0.9% |
| 0.9932 | 34 | 0.9% |
| 0.9984 | 34 | 0.9% |
| 0.998 | 34 | 0.9% |
| Other values (861) | 3524 |
| Value | Count | Frequency (%) |
| 0.98711 | 1 | |
| 0.98722 | 1 | |
| 0.9874 | 1 | |
| 0.98742 | 1 | |
| 0.98746 | 1 | |
| 0.98774 | 1 | |
| 0.98779 | 1 | |
| 0.98794 | 1 | |
| 0.98815 | 1 | |
| 0.98816 | 1 |
| Value | Count | Frequency (%) |
| 1.00315 | 2 | |
| 1.00295 | 2 | |
| 1.00289 | 1 | |
| 1.0026 | 2 | |
| 1.00242 | 2 | |
| 1.00241 | 1 | |
| 1.0022 | 2 | |
| 1.0021 | 1 | |
| 1.00196 | 1 | |
| 1.0018 | 1 |
pH
Real number (ℝ)
| Distinct | 103 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2170131 |
| Minimum | 2.74 |
|---|---|
| Maximum | 4.01 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 2.74 |
|---|---|
| 5-th percentile | 2.96 |
| Q1 | 3.1 |
| median | 3.2 |
| Q3 | 3.32 |
| 95-th percentile | 3.5 |
| Maximum | 4.01 |
| Range | 1.27 |
| Interquartile range (IQR) | 0.22 |
Descriptive statistics
| Standard deviation | 0.16102424 |
|---|---|
| Coefficient of variation (CV) | 0.05005396 |
| Kurtosis | 0.31248253 |
| Mean | 3.2170131 |
| Median Absolute Deviation (MAD) | 0.11 |
| Skewness | 0.37401536 |
| Sum | 12536.7 |
| Variance | 0.025928807 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.16 | 118 | 3.0% |
| 3.14 | 113 | 2.9% |
| 3.22 | 109 | 2.8% |
| 3.2 | 108 | 2.8% |
| 3.19 | 108 | 2.8% |
| 3.15 | 101 | 2.6% |
| 3.08 | 98 | 2.5% |
| 3.3 | 97 | 2.5% |
| 3.1 | 97 | 2.5% |
| 3.24 | 96 | 2.5% |
| Other values (93) | 2852 |
| Value | Count | Frequency (%) |
| 2.74 | 2 | 0.1% |
| 2.79 | 1 | < 0.1% |
| 2.8 | 2 | 0.1% |
| 2.82 | 1 | < 0.1% |
| 2.83 | 3 | 0.1% |
| 2.84 | 1 | < 0.1% |
| 2.85 | 7 | |
| 2.86 | 3 | 0.1% |
| 2.87 | 7 | |
| 2.88 | 8 |
| Value | Count | Frequency (%) |
| 4.01 | 1 | < 0.1% |
| 3.9 | 1 | < 0.1% |
| 3.85 | 1 | < 0.1% |
| 3.82 | 1 | < 0.1% |
| 3.81 | 1 | < 0.1% |
| 3.8 | 1 | < 0.1% |
| 3.78 | 1 | < 0.1% |
| 3.76 | 2 | |
| 3.75 | 3 | |
| 3.74 | 1 | < 0.1% |
sulphates
Real number (ℝ)
| Distinct | 103 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.53222222 |
| Minimum | 0.22 |
|---|---|
| Maximum | 2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 0.22 |
|---|---|
| 5-th percentile | 0.35 |
| Q1 | 0.43 |
| median | 0.51 |
| Q3 | 0.6 |
| 95-th percentile | 0.8 |
| Maximum | 2 |
| Range | 1.78 |
| Interquartile range (IQR) | 0.17 |
Descriptive statistics
| Standard deviation | 0.1504541 |
|---|---|
| Coefficient of variation (CV) | 0.28269038 |
| Kurtosis | 8.4198238 |
| Mean | 0.53222222 |
| Median Absolute Deviation (MAD) | 0.08 |
| Skewness | 1.8115237 |
| Sum | 2074.07 |
| Variance | 0.022636436 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5 | 161 | 4.1% |
| 0.54 | 140 | 3.6% |
| 0.46 | 138 | 3.5% |
| 0.44 | 135 | 3.5% |
| 0.48 | 131 | 3.4% |
| 0.38 | 127 | 3.3% |
| 0.52 | 120 | 3.1% |
| 0.53 | 119 | 3.1% |
| 0.49 | 118 | 3.0% |
| 0.45 | 116 | 3.0% |
| Other values (93) | 2592 |
| Value | Count | Frequency (%) |
| 0.22 | 1 | < 0.1% |
| 0.23 | 1 | < 0.1% |
| 0.25 | 3 | 0.1% |
| 0.26 | 3 | 0.1% |
| 0.27 | 6 | 0.2% |
| 0.28 | 6 | 0.2% |
| 0.29 | 13 | |
| 0.3 | 24 | |
| 0.31 | 18 | |
| 0.32 | 29 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 1.98 | 1 | < 0.1% |
| 1.62 | 1 | < 0.1% |
| 1.61 | 1 | < 0.1% |
| 1.59 | 1 | < 0.1% |
| 1.36 | 3 | |
| 1.34 | 1 | < 0.1% |
| 1.33 | 1 | < 0.1% |
| 1.28 | 1 | < 0.1% |
| 1.26 | 1 | < 0.1% |
alcohol
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 92 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.48646 |
| Minimum | 8 |
|---|---|
| Maximum | 14.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 8.9 |
| Q1 | 9.5 |
| median | 10.3 |
| Q3 | 11.3 |
| 95-th percentile | 12.7 |
| Maximum | 14.9 |
| Range | 6.9 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 1.1985447 |
|---|---|
| Coefficient of variation (CV) | 0.11429451 |
| Kurtosis | -0.51830535 |
| Mean | 10.48646 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | 0.57474188 |
| Sum | 40865.733 |
| Variance | 1.4365095 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.5 | 220 | 5.6% |
| 9.4 | 202 | 5.2% |
| 9.2 | 167 | 4.3% |
| 10 | 145 | 3.7% |
| 9.8 | 134 | 3.4% |
| 9 | 133 | 3.4% |
| 11 | 132 | 3.4% |
| 10.5 | 132 | 3.4% |
| 10.2 | 124 | 3.2% |
| 10.4 | 115 | 3.0% |
| Other values (82) | 2393 |
| Value | Count | Frequency (%) |
| 8 | 2 | 0.1% |
| 8.4 | 3 | 0.1% |
| 8.5 | 7 | 0.2% |
| 8.6 | 15 | 0.4% |
| 8.7 | 58 | 1.5% |
| 8.8 | 66 | 1.7% |
| 8.9 | 46 | 1.2% |
| 9 | 133 | |
| 9.1 | 98 | |
| 9.2 | 167 |
| Value | Count | Frequency (%) |
| 14.9 | 1 | < 0.1% |
| 14.2 | 1 | < 0.1% |
| 14 | 7 | |
| 13.9 | 2 | 0.1% |
| 13.8 | 1 | < 0.1% |
| 13.7 | 5 | 0.1% |
| 13.6 | 8 | |
| 13.5 | 7 | |
| 13.4 | 16 | |
| 13.3 | 5 | 0.1% |
quality
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8196048 |
| Minimum | 3 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.6 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 9 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.879288 |
|---|---|
| Coefficient of variation (CV) | 0.15109067 |
| Kurtosis | 0.19994369 |
| Mean | 5.8196048 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.23421838 |
| Sum | 22679 |
| Variance | 0.77314738 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 1681 | |
| 5 | 1294 | |
| 7 | 646 | 16.6% |
| 4 | 134 | 3.4% |
| 8 | 123 | 3.2% |
| 3 | 15 | 0.4% |
| 9 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 3 | 15 | 0.4% |
| 4 | 134 | 3.4% |
| 5 | 1294 | |
| 6 | 1681 | |
| 7 | 646 | 16.6% |
| 8 | 123 | 3.2% |
| 9 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 9 | 4 | 0.1% |
| 8 | 123 | 3.2% |
| 7 | 646 | 16.6% |
| 6 | 1681 | |
| 5 | 1294 | |
| 4 | 134 | 3.4% |
| 3 | 15 | 0.4% |
wine_type
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.6 KiB |
| white | |
|---|---|
| red |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.5001283 |
| Min length | 3 |
Characters and Unicode
| Total characters | 17537 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | white |
|---|---|
| 2nd row | red |
| 3rd row | red |
| 4th row | white |
| 5th row | white |
Common Values
| Value | Count | Frequency (%) |
| white | 2923 | |
| red | 974 | 25.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 2923 | |
| red | 974 | 25.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3897 | |
| w | 2923 | |
| h | 2923 | |
| i | 2923 | |
| t | 2923 | |
| r | 974 | 5.6% |
| d | 974 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17537 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3897 | |
| w | 2923 | |
| h | 2923 | |
| i | 2923 | |
| t | 2923 | |
| r | 974 | 5.6% |
| d | 974 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17537 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3897 | |
| w | 2923 | |
| h | 2923 | |
| i | 2923 | |
| t | 2923 | |
| r | 974 | 5.6% |
| d | 974 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17537 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3897 | |
| w | 2923 | |
| h | 2923 | |
| i | 2923 | |
| t | 2923 | |
| r | 974 | 5.6% |
| d | 974 | 5.6% |
| fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | wine_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| fixed acidity | 1.000 | 0.186 | 0.284 | -0.036 | 0.354 | -0.261 | -0.239 | 0.433 | -0.246 | 0.218 | -0.105 | -0.090 | 0.514 |
| volatile acidity | 0.186 | 1.000 | -0.296 | -0.068 | 0.414 | -0.377 | -0.354 | 0.257 | 0.219 | 0.247 | -0.013 | -0.269 | 0.654 |
| citric acid | 0.284 | -0.296 | 1.000 | 0.075 | -0.053 | 0.133 | 0.160 | 0.083 | -0.286 | 0.044 | 0.009 | 0.104 | 0.439 |
| residual sugar | -0.036 | -0.068 | 0.075 | 1.000 | -0.035 | 0.361 | 0.451 | 0.523 | -0.224 | -0.132 | -0.335 | -0.020 | 0.427 |
| chlorides | 0.354 | 0.414 | -0.053 | -0.035 | 1.000 | -0.271 | -0.286 | 0.593 | 0.171 | 0.364 | -0.396 | -0.300 | 0.766 |
| free sulfur dioxide | -0.261 | -0.377 | 0.133 | 0.361 | -0.271 | 1.000 | 0.742 | -0.019 | -0.169 | -0.220 | -0.181 | 0.081 | 0.543 |
| total sulfur dioxide | -0.239 | -0.354 | 0.160 | 0.451 | -0.286 | 0.742 | 1.000 | 0.044 | -0.247 | -0.259 | -0.307 | -0.063 | 0.809 |
| density | 0.433 | 0.257 | 0.083 | 0.523 | 0.593 | -0.019 | 0.044 | 1.000 | 0.022 | 0.274 | -0.692 | -0.322 | 0.435 |
| pH | -0.246 | 0.219 | -0.286 | -0.224 | 0.171 | -0.169 | -0.247 | 0.022 | 1.000 | 0.260 | 0.137 | 0.040 | 0.345 |
| sulphates | 0.218 | 0.247 | 0.044 | -0.132 | 0.364 | -0.220 | -0.259 | 0.274 | 0.260 | 1.000 | 0.022 | 0.038 | 0.477 |
| alcohol | -0.105 | -0.013 | 0.009 | -0.335 | -0.396 | -0.181 | -0.307 | -0.692 | 0.137 | 0.022 | 1.000 | 0.461 | 0.145 |
| quality | -0.090 | -0.269 | 0.104 | -0.020 | -0.300 | 0.081 | -0.063 | -0.322 | 0.040 | 0.038 | 0.461 | 1.000 | 0.128 |
| wine_type | 0.514 | 0.654 | 0.439 | 0.427 | 0.766 | 0.543 | 0.809 | 0.435 | 0.345 | 0.477 | 0.145 | 0.128 | 1.000 |
| fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | wine_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 9.5 | 0.420 | 0.41 | 2.3 | 0.034 | 22.0 | 145.0 | 0.99510 | 3.06 | 0.52 | 11.0 | 6.0 | white |
| 1 | 7.6 | 0.665 | 0.10 | 1.5 | 0.066 | 27.0 | 55.0 | 0.99655 | 3.39 | 0.51 | 9.3 | 5.0 | red |
| 2 | 8.5 | 0.280 | 0.35 | 1.7 | 0.061 | 6.0 | 15.0 | 0.99524 | 3.30 | 0.74 | 11.8 | 7.0 | red |
| 3 | 6.1 | 0.200 | 0.40 | 1.9 | 0.028 | 32.0 | 138.0 | 0.99140 | 3.26 | 0.72 | 11.7 | 5.0 | white |
| 4 | 6.4 | 0.280 | 0.44 | 7.1 | 0.048 | 49.0 | 179.0 | 0.99528 | 3.15 | 0.48 | 9.2 | 5.0 | white |
| 5 | 5.7 | 0.270 | 0.16 | 9.0 | 0.053 | 32.0 | 111.0 | 0.99474 | 3.36 | 0.37 | 10.4 | 6.0 | white |
| 6 | 7.4 | 0.160 | 0.27 | 15.5 | 0.050 | 25.0 | 135.0 | 0.99840 | 2.90 | 0.43 | 8.7 | 7.0 | white |
| 7 | 7.9 | 0.180 | 0.49 | 5.2 | 0.051 | 36.0 | 157.0 | 0.99530 | 3.18 | 0.48 | 10.6 | 6.0 | white |
| 8 | 12.0 | 0.370 | 0.76 | 4.2 | 0.066 | 7.0 | 38.0 | 1.00040 | 3.22 | 0.60 | 13.0 | 7.0 | red |
| 9 | 5.6 | 0.210 | 0.40 | 1.3 | 0.041 | 81.0 | 147.0 | 0.99010 | 3.22 | 0.95 | 11.6 | 8.0 | white |
| fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | wine_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3887 | 8.9 | 0.33 | 0.32 | 1.5 | 0.047 | 11.0 | 200.0 | 0.99540 | 3.19 | 0.46 | 9.40 | 5.0 | white |
| 3888 | 5.6 | 0.62 | 0.03 | 1.5 | 0.080 | 6.0 | 13.0 | 0.99498 | 3.66 | 0.62 | 10.10 | 4.0 | red |
| 3889 | 7.5 | 0.24 | 0.29 | 1.1 | 0.046 | 34.0 | 84.0 | 0.99020 | 3.04 | 0.39 | 11.45 | 6.0 | white |
| 3890 | 6.2 | 0.15 | 0.27 | 1.4 | 0.041 | 51.0 | 117.0 | 0.99090 | 3.28 | 0.38 | 11.20 | 6.0 | white |
| 3891 | 7.1 | 0.20 | 0.27 | 9.6 | 0.037 | 19.0 | 105.0 | 0.99444 | 3.04 | 0.37 | 10.50 | 7.0 | white |
| 3892 | 7.7 | 0.25 | 0.30 | 7.8 | 0.038 | 67.0 | 196.0 | 0.99555 | 3.10 | 0.50 | 10.10 | 5.0 | white |
| 3893 | 10.7 | 0.43 | 0.39 | 2.2 | 0.106 | 8.0 | 32.0 | 0.99860 | 2.89 | 0.50 | 9.60 | 5.0 | red |
| 3894 | 10.0 | 0.29 | 0.40 | 2.9 | 0.098 | 10.0 | 26.0 | 1.00060 | 3.48 | 0.91 | 9.70 | 5.0 | red |
| 3895 | 5.2 | 0.24 | 0.45 | 3.8 | 0.027 | 21.0 | 128.0 | 0.99200 | 3.55 | 0.49 | 11.20 | 8.0 | white |
| 3896 | 6.5 | 0.23 | 0.36 | 16.3 | 0.038 | 43.0 | 133.0 | 0.99924 | 3.26 | 0.41 | 8.80 | 5.0 | white |
Most frequently occurring
| fixed acidity | volatile acidity | citric acid | residual sugar | chlorides | free sulfur dioxide | total sulfur dioxide | density | pH | sulphates | alcohol | quality | wine_type | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 279 | 7.4 | 0.16 | 0.30 | 13.7 | 0.056 | 33.0 | 168.0 | 0.99825 | 2.90 | 0.44 | 8.7 | 7.0 | white | 6 |
| 157 | 6.8 | 0.18 | 0.30 | 12.8 | 0.062 | 19.0 | 171.0 | 0.99808 | 3.00 | 0.52 | 9.0 | 7.0 | white | 5 |
| 194 | 7.0 | 0.15 | 0.28 | 14.7 | 0.051 | 29.0 | 149.0 | 0.99792 | 2.96 | 0.39 | 9.0 | 7.0 | white | 5 |
| 283 | 7.4 | 0.19 | 0.31 | 14.5 | 0.045 | 39.0 | 193.0 | 0.99860 | 3.10 | 0.50 | 9.2 | 6.0 | white | 5 |
| 117 | 6.6 | 0.22 | 0.23 | 17.3 | 0.047 | 37.0 | 118.0 | 0.99906 | 3.08 | 0.46 | 8.8 | 6.0 | white | 4 |
| 138 | 6.7 | 0.16 | 0.32 | 12.5 | 0.035 | 18.0 | 156.0 | 0.99666 | 2.88 | 0.36 | 9.0 | 6.0 | white | 4 |
| 149 | 6.7 | 0.46 | 0.24 | 1.7 | 0.077 | 18.0 | 34.0 | 0.99480 | 3.39 | 0.60 | 10.6 | 6.0 | red | 4 |
| 278 | 7.4 | 0.16 | 0.27 | 15.5 | 0.050 | 25.0 | 135.0 | 0.99840 | 2.90 | 0.43 | 8.7 | 7.0 | white | 4 |
| 282 | 7.4 | 0.19 | 0.30 | 12.8 | 0.053 | 48.5 | 229.0 | 0.99860 | 3.14 | 0.49 | 9.1 | 7.0 | white | 4 |
| 1 | 5.0 | 0.33 | 0.16 | 1.5 | 0.049 | 10.0 | 97.0 | 0.99170 | 3.48 | 0.44 | 10.7 | 6.0 | white | 3 |